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Abstract: A methodical analysis of the research related to the ar- 
ticle, "Sur les groupes continus ", of Henri Poincare reveals many 
historical misconceptions and inaccuracies regarding his contribu- 
tion to Lie theory. A thorough reading of this article confirms 
the precedence of his discovery of many important concepts, espe- 
cially that of the universal enveloping algebra of a Lie algebra over 
the real or complex field, and the canonical map (symmetrization) 
of the symmetric algebra onto the universal enveloping algebra. 
The essential part of this article consists of a detailed discussion 
of his rigorous, complete, and enlightening proof of the so-called 
Birkhoff-Witt theorem. 



TITRE: La demonstration de Poincare du theoreme de Birkhoff-Witt. 

RESUME: Une analyse methodique des travaux faits en connexion avec I'article, 
"Sur les groupes continus", de Henri Poincare revele des erreurs historiques et 
des jugements injustes en ce qui concerne sa contribution a la theorie de Lie. 
Une etude approfondie de cet article confirme la precedence de sa decouverte de 
plusieurs concepts importants; notamment de I'algebre enveloppante universelle 
d'une algebre de Lie sur le corps reel ou le corps complexe, et de I'application 
canonique {la symetrisation) de I'algebre symctrique sur I'algebre enveloppante 
universelle. L'essentiel de cet article consiste en un examen approfondi de sa 
demonstration rigoureuse et complete du theoreme de Birkhoff-Witt. 

Liste de quelques mots-cles: L'algebre enveloppante universelle, I'application 
canonique (symetrisation), le theoreme de Birkhoff-Witt. 
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1. Introduction 



In our research on the universal enveloping algebras of certain infinite-dimensional 
Lie algebras we were led to study in detail the original proofs of the so-called 
Birkhoff-Witt theorem (more recently, Poincare-Birkhoff-Witt theorem). This, in 
turn, led us to the investigation of Poincare's contribution to Lie theory (i.e., the 
theory of Lie groups. Lie algebras, and their representations). To our great surprise 
we discovered many historical misconceptions and inaccuracies, even in some of 
the classics written by the leading authorities on the subject. This discovery has 
puzzled us for some time, and we have sought the opinions of several experts in the 
field. Their answers together with our thorough reading of several original articles 
on the subject shed some light on this mystery. We were astounded to find out that 
Poincare was given neither credit for his fundamental discovery of the universal 
enveloping algebra of a Lie algebra over a field of characteristic zero, nor for his 
introduction of the symmetrization map, and only a cursory and belated acknowl- 
edgment of his contribution to the so-called Birkhojf- Witt theorem, of which he gave 
a rigorous, complete, beautiful, and enlightening proof. Indeed, in two of the most 
exhaustive treatis es on universal enveloping algebra [ Cohn 1981 ] and | Dixmier 1974| 
Poincare's work [Poincare 1900| was not mentioned. In many authoritative text- 
books treati ng Lie theory suc h as ||C he valley 1955 , [ Cartan &: Eilenberg 1956 |, 



Kuros 19631 [jjacobson 1962| , [Varadarajan 1984 (197"4)1[ [ [Humphreys 1972| , 
Knapp 198£[,. . . , Poincare's discovery of the universal enveloping algebra and 
the symmetrization map was ignored. In some books his name was left off the 



Birkhoff-Witt theorem, and his fundamental article [Poincare 190C[ was not even 
quoted. In Encyclopaedia of Mathematics [Encyclopaedia 1988-1994 under the 
rubric "Birkhoff-Witt theorem" it was written ". . . The first variant of this theorem 
was obtained by H. Poincare; the theorem was subsequently completely demon- 
strated by E. Witt [1937] and G. D. BirkhoflP] [19 37]. . .". Clearly the author, T.S. 
Fofanova, did not read carefully [ Poincare 1900[ ; otherwise she would have real- 
ized that Poincare had discovered and completely demonstrated this theorem at 
least thirty-seven years before Witt and Birkhoff. Why such iniquities can hap- 
pen to one of the greatest mathematicians of all times who published these results 



Poincare 190C[ in one of the most prestigious scientific journals. Trans. Camh. Phi- 



los. Soc, on the occasion of the jubilee of another great mathematician. Sir George 
Gabriel Stokes, is a most interesting mystery that we shall attempt to elucidate in 
this article. But before beginning our investigation we want to make it clear that 
our intention is to study thoroughly one of the most fundamental discoveries by one 
of the greatest minds in order to understand how important ideas are created, and 
not to rectify such iniquities, for such a task is doomed to fail as the force of habit 
always prevails; a fact very perspicuously expressed in the following excerpt from 
Gittleman 1975 , p. 186], ". . . rHospital's rule, Maclaurin's series, Cramer's rule, 
RoUe's theorem, and Taylor's series are familiar terms to calculus students. Actu- 
ally, only one of these five mathematicians was the original discoverer of the result 
attributed to him, and that man was RoUe. The person who popularizes a result 



^Actually Garrett Birkhoflf (1911-1996), not G.D. (Birkhoff) which are the initials of George 
David Birkhoff (1884-1944), the father of Garrett. This inaccuracy only occurs in the translation, 
not in the original (Russian) version of the Encyclopaedia. We are grateful to Professor Sergei 
Silvestrov for elucidating this fact to us. 
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generally has his name attached to it, although later it may be learned that some- 
one else had originally discovered the same result. For practical purposes names are 
not changed, but even so, the mistakes seem to compensate for one another. Al- 
though Maclaurin was credited with a series he did not discover, a rule which he did 
originate is now known as Cramer's rule. . . " . Besides, Poincare is a member of the 
elite group of mathematicians to whom many important mathematical discoveries 
are attributed; indeed, in Encyclopaedia of Mathematics [ Encyclopaedia 1988-199^ 
18 rubrics are listed under his name. Curiously, under the heading "Poincare last 
theorem" the editorial comments state that "[this theorem] is also known as the 
Poincare-Birkhoff fixed-point theorem," and the author, M.I. Voitsekhovskii, wrote 
". . .it was proved by him in a series of particular cases but he did not, however, 
obtain a general proof of this theorem."^ Misnaming mistakes seem to compensate 
one another after all. 



In his book [ Bell 1937 |, E.T. Bell, who called Poincare "the Last Universalist" , 
considered the Last Geometric Theorem as Poincare's "unfinished symphony" and 
wrote ". . . And it may be noted that Poincare turned his universality to magnifi- 
cent use in disclosing hitherto unsuspected connections between distant branches of 
mathematics, for example, between continuous groups and linear algebra" . This is 
exactly the impression we had when reading his article, "Sur les groupes continus" . 

2. Poincare's work on Lie groups 

To assess Poin care' s contribution to Lie theory in general we use two main sources 
|Poincare 1916- [ and [ Poincare in mcmoriam 1921 1 and investigate in depth the ref- 
erences cited therein. We start with the article, "Analyse des travaux scientifiques 
de He nri Poincare, faite par lui-me me 'j^ which was written by Poincare himself in 
1901 ([ Poincare in mcmoriam 1921, 3-135] ) at the request of G. Mittag-Leffler (cf. 
"Au lecteur" [ Poincare in mcmoriam 192l[ , 1-2]). It is part of vol. 38 of the jour- 
nal. Acta Math., published in 1921 in memory of Henri Poincare. (Actually, most 
of vol. 39 published in 1923 is also devoted to Poincare's work). In the third part 
of the above-mentioned article. Section XII (Algebre) and Section XIII (Groupes 

■^Voitsekhovskii continues, "The paper was sent by Poincare to an Italian journal two weeks 
before his death, and the author expressed his conviction, in an accompanying letter to the editor, 
of the validity of the theorem in the general case." Indeed, on December 9, 1911, having some 
presentiments that he might not live long, Poincare wrote a moving letter to Guccia. directo r and 
founder of the journal Rendiconti del Circolo Matematico di Palermo (cf. [Poincare 1916- , t. 2, 
p. LXVII]), ask ing his opinion regar ding what has become known as "Poincare's Last Geometric 
Theorem" (see [Barrow-Green 1997, §7.4.2, 169-174], for an English translation of the letter and 
an excellent discussion of the theorem). Mr. Guccia readily accepted the memoir for publication 
and it appeared on March 10, 1912, just a few months before Poincare's death on July 17, 1912 
(Sur un theoreme de Geometric, Rendiconti del Circolo Matematico di Palermo, 33, 375-407 
= Oeuvres VII, 499—538). Ultimately it was G.D. Birkhoff who gave a complete proof of this 
theorem (Proof of Poincare's geometric theorem, Trans. Amer. Math. Soc, 14 (1913), pp. 14— 
22 = Collected Mathematical Papers I, 673-681) and of its generalization to n dimensions (Une 
generalisation a n dimensions du dernier theoreme de geometric de Poincare, CR. Acad. Sci. 
Paris, 1 92 (W31). dp. 196-198 = Collected Mathematical Papers II, 395-397). 

''In [ Poincare 1916-) this article is listed as published by Acta Math., 30 (1913), pp. 90— 
92. In fact, it never existed as such; the editors of Poincare's collected work probably found the 
manuscript of the article among his papers with his annotations rega rding the journal and the dat e 
of publication but due to World War I it appeared eventually in [Poincare in mcmoriam 1921 [. 
This remark extends to all di screpancies between the inten ded and actual dates of publication of 
many of Poincare's works in Poincare in mcmoriam 192 1| , for example, Rapport sur les travaux 
de M. Cartan. 
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Continus) are devoted to his contribution to Lie theory. ActuaUy, we think that 
under Poincare's impetus finite-dimensional continuous groups are eventually called 
Lie groups. Indeed, Poincare expressed repeatedly his great admiration for Lie's 
work in [ Poincare 1899[ and [ Poincare 190C[ and wrote in Rapport sur les travaux de 
M. Cartan ([Poincare in memoriam 1921, 137-145], with a curious footnote: Acta 
Math., 38, printed on August 11, 1914): "Je commencerai par les groupes continus 
et finis, qui ont ete introduits par Lie dans la science; le savant norvegien a fait 
connaitre les principes fondamentaux de la theorie, et il a montre en particulier que 
la structure de ces groupes depend d'un certain nombre de constantes qu'il designe 
par la lettre c affectee d'un triple indice et entre lesquelles il doit y avoir certaines 
relations. . . une des plus importantes applications des groupes de Lie. . . ". So far 
as we know this is the first time that the name Lie groups was explicitly mentioned. 

Poincare's first encounter with Lie theory probably dated back to his article 
[Poincare 1881 [ and its generalization | Poincare 1883t . The problem he considered 
there can be phrased in modern language as follows: 

For X — {xi, . . . ,Xn) G C" let GL„(C), the general linear group of all n x n 
invertible complex matrices, act on C" via {X,g) Xg, g e GL„(C). Let F{X) 
denote a homogeneous form of degree m (i.e., a homogeneous polynomial of degree 
m in n variables {xi, ... ,Xn)), find the subgroup G of GL„(C) which preserves 
the form F; i.e., F{Xg) — F{X),yg e G. Conversely, given a subgroup G of 
GL„(C) find all homogeneous forms that are left inva riant by G. This is precisely 
the pr oblem of polyn omial invariants (cf. Weyl 1946) ). 

In [[Poincare I881I and [^ oincare 1883| he found all cubic ternary (of three vari- 
ables) and quaternary (of four variables) forms that are preserved by certain Abelian 
groups (which he called "faisceau de substitutions" ) , and he also extended this re- 
sult to the non- Abelian case. Conversely, he exhibited explicit groups that preserve 



quadratic and cubic ternary and quaternary forms. For example, in [Poincare 1881 
239-241] he found the subgroup of the unipotent group 





a 








1 


n 


;a,/3,7ecj 












which preserves the quadratic form 
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2B1X2X3 + 2B2X1X3 + 2B3X1X2, A,,Bi e C;l < i < 3. 



Conversely, he showed that all quadratic forms which are left invariant by the full 
unipotent group defined above must satisfy a certain partial differential equation. 
Lie theory also plays an important role in Poincare's work on the conformal rep- 
resentation of functions of two variables which leads to the theory of relativity. In 
[Poincare 190(: [ and [ Poincare 1912| he studied the group of linear transformations 



y 



t invariant which he called the 



which leave the Minkowski's metric x"^ 
homogeneous Lorentz group, or as H.A. Lorentz wrote in [Lorentz 1921 [ "groupe 
de relativite" , and discovered the Poincare group which is the semidirect product 
of the four-dimensional translation group with the homogeneous Lorentz group. 

In his analysis of his scientific accomplishments Poincare classified his work in 
seven topics which range from "Differential Equations" to "Philosophy of Science" . 
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His accomplishments in any single one of these areas would already make him fa- 
mous. Indeed Sir George H. Darwin (1845-1912), a physicist and son of the famous 
Charles Darwin (1809-1882), wrote in 1909 that Poincare's celestial mechanics 
would be a vast mine for researchers for half a century (| Boyer 1968t p. 652]). Un- 
der rubric number three "Questions diverses de Mathematiques pures", Algebra, 
Arithmetic, Group theory, and Analysis Situs (combinatorial topology) are listed 
together, with Lie groups as a subsection of Group theory. This gives a false impres- 
sion that he had only a slight interest in the subject. In fact with the exception of his 



first article on continuous groups [ Poincare 1899| the other three articles are quite 
long: [ Poincare 1900{ (35 pages), [ Poincare 1901| (47 pages) and [ Poincare 1908| 
(60 pages). In all these articles he not only conveyed to the reader his keen interest 
in the subject but also some of the difficulties that preoccupied him over a ten-year 
period. 



It was Lie's third theorem that motivated Poincare to write [ Poincare 1899 [. 
This theorem can be stated in his notation^ as follows: 

If {Xi, . . . ,Xr} is a system of infinitesimal transformations (i.e., vector fields) 
which satisfy the equation 



(2.1) 



^iksXs 



then the structure constants Ciks must satisfy the relations 

(2.2) Cfiis — ^iks: 



(2.3) ^^{ciksCjik + ciksCijk + CjksCiik) = i^<iJJ,s<r), 

k=l 

which follow immediately from the fact that the bracket [ • , • ] is skew symmetric, 
and the Jacobi identity 

[[Xa^XblX,] + [[Xb,X,lXa] + [[X,,XalXb] = 0. 



Conversely, if the coefficients Ciks satisfy Eqs. ( |2.2| ) and (2.3) then there exists a 
system of infinitesimal transformations verifying Eq. (2.1), and hence a group of 
transformations with r parameters. 

In [Poincare 1900 he gave a different proof of this theorem, especially for the 
case of Lie algebras with non-trivial centers. His approach consists of reformu- 
lating Lie's construction of the adjoint group, deriving the differential equations 
associated with the parametric group, and then showing that these equations can 
be integrated. More specifically, if Xi, . . . , Xr form a basis of a Lie algebra of 
infinitesimal transformations, and 

then one has the adjoint representation 



^Poincare used parentheses instead of brackets for the commutator products. To avoid confu- 
sion we replace the parentheses with the more conventional brackets. He also used the notation 
Ciks instead of the more convenient notation c'f, for the structure constants. We do not how- 
ever replace this notation, which docs not cause any confusion, to preserve as much as possible 
Poincare's style and terminology. 
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where T' — J2 t'i^i and e*"^ = X^^^o "^^^ image of the adjoint representation 

is then caUed the adjoint group. By setting 



V T 

e e 



where W — WiXi 



it follows that the w are functions of the 



V and t, or, in other words the trans- 
formation transforms into , i.e., the v into the w; and the group thus 
defined in r variables is called the parametric group associated with the system 
{Xi, . . . ,Xr}- In jPoincare 1901| and [Poincare 1908 he studied in great detail 
these differential equations and the isomorphism between the adjoint and para- 
metric groups. His research into expressions of as a function of U and V in the 
formula e^e^ — resulted in a precursory form of the Baker-Campbell- Hausdorff 



formula (see [ Schmid 1982 for a discussion re garding Poi ncare's contribu tion to this 
theorem and, e.g., Varadarajan 1984 (1974) , p. 114] or [ Bourbaki 1972 , Chap. II], 



for a more modern proof of this theorem). 



3. Poincare's discovery of the universal enveloping algebra and the 
so-called birkhoff-witt theorem 

In this section we shall expound the main theme of this article, namely, Poincare's 
precedence in the discovery of the universal enveloping algebra and the so-called 
Birkhoff-Witt theorem. For this purpose we shall examine in detail his article 
Poincare 190C]. As a general rule we try to adhere faithfully to his exposition, 
notation, and style as much as possible. But in order to make our point we shall 
insert some comments, prove some claims which Poincare considered self-evident 
but did not seem to be so obvious to us, and integrate his work in the more mod- 
ern framework of Lie theory. At first reading jPoincare 190C] seems to be hastily 
written, repetitive, and sometimes cryptic, and this might explain why not many 
people have read it; especially for the readers to whom French is not their first 



language. But by a careful analysis of [Poincare 1900] one must conclude without 
a shade of doubt that Poincare had discovered the concept of the universal algebra 
of a Lie algebra and gave a complete and rigorous proof of the so-called Birkhoff- 
Witt theorem. As we shall see, his entire proof of this theorem, with the exception 



of the claim that we will state as Theorem 5.3, is quite rigorous and modern in 



language. For these reasons we will translate the parts in jPoincare 190C] that are 



relevant to our discussion for the benefit of the readers who are not familiar with 
French. But before going into the details we shall elaborate on why we consider his 
proof very enlightening. For example, his introduction of the symmetrization map 
is quite natural by observing that the most elementary "regular" (or "symmetric" ) 
polynomials are the linear polynomials and their powers. And, as it turned out, all 
symmetrized polynomials are linear combinations of those. From the symmetriza- 
tion comes out naturally the notion of equipollence which, in turn, leads to the 
notion of equivalence^ and ultimately to the definition of the universal enveloping 
algebra of a Lie algebra. In the proof of the fact stated as Theorem below, 
he introduced the notion of "chains" and cleverly showed that he can add more 
chains to paradoxically reduce the number of basic chains, and this enables him to 
proceed by induction (on the degree of the regular polynomials). This ingenious 
idea foreshadows some techniques used in the modern theory of word problem. 

For the remainder of this section, in order to capture Poincare's vivid flow of 
ideas we shall use the present tense to present his exposition. For convenience, we 
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shall discuss the universal enveloping algebra first. Let Xi, . . . , Xn be n elementary 
operators (Poincare thinks of these operators as vector fields but never really uses 
this fact here). Let C be the Lie algebra over a field of characteristic zero K 
(Poincare thinks of K as M or C but all concepts and proofs remain identical) 
generated by these n elementary operators which constitute a basis for C. Let A 
denote the non-commutative algebra of polynomials in n variables Xi, . . . , Xn with 
coefficients in K. Consider the set of all elements of A of the form 



(3.1) 



P{XY ~YX ~ [X,Y])Q, 



where P and Q are arbitrary polynomials in A, and where [X, Y] denotes the 
bracket product of C. Define an equivalence relation ~ in ^ by declaring that an 
element ^4 S -4 is equivalent to if A is a linear combination of elements of the form 
( ^ ) for some P and Q in A, and A A' , A' £ A, ii A~ A' 0. Then the quotient 
algebra (or residue ring) thus defined is now called the universal enveloping algebra 
of C. In fact, this can be rephrased in modern language as follows: 

Let T denote the tensor algebra ov er the unde rlying vector space of £, then T 
is isomorphic to A (see, for example, [ Lang 1965 , Prop. 10, p. 423]). Let J denote 
the two-sided ideal of T generated by the tensors X^Y — Y®X~ [X, Y] where 
X,Y £ L. Then the as sociative algebr a lA = T/ J is called the u niversal enveloping 
algebra of £ (cf., e.g., jBourbaki 1975i p. 12] or fBourbaki 1960| p. 22]). Under the 
isomorphism between T and A the ideal J corresponds to the two-sided ideal of A 
spanned by all elements of the form (3.1). Actually this is exactly Harish-Chandra's 
approach to the universal enveloping algebra in |Harish- Chandra 1949|. Unaware 
that Poincare had defined this notion in [Poincare 1900 1, Harish-Chandra wrote 



the following footnote: " This algebra is the same as the one considered by Birkhoff 
[1937] and Witt [1937], though their method of construction is different". Indeed, 
as we mentioned earlier, Harish-Chandra's definition of the universal enveloping 
algebra is identical to Poincare's. He also wrote: "In view of this 1-1 correspondence 
between representations of C and U it is appropriate to call U the general enveloping 



algebra of £", and in his fundamental paper [Harish-Chandra 1951 1 on the role of 
the universal enveloping algebra of a semisimple Lie algebra in Lie theory, published 
two years later, he replaced the word "general" with "universal", probably under 
the influence of Birkhoff's work on universal algebras. Thus we can conclude that 
it was Harish-Chandra who named this algebra discovered by Poincare "universal 

enveloping algebra" . 

In his very influential book [ Chevalley 1955| , C. Chevalley, one of the world's 
leading experts in Lie theory and a founding member of Bourbaki, attributed to 
Harish-Chandra the following theorem: "There exists a one-to-one correspondence 
{but not multiplicativel) between elements of U and those of the symmetric algebra 
of C; also if C is the Lie algebra of a Lie group G, then 2A is isomorphic to the 
algebra of right {or left) invariant differential operators over the algebra of ana- 
lytic functions on G" (see Chapter 5, vol. Ill of [ Chevalley 1955 1, especially §6). 
Note that Chevalley and Harish-Chandra were colleagues at Columbia University 
during this period. We shall establish below that Poincare had already discovered 
this correspondence (usually called the canonical isomorphism or the symmetriza- 
tion map) and had defined hi as the algebra of right (or left) invariant differential 
operators on G. It is also interesting to note that Chevalley also gives a proof 
of the Birkhoff- Witt theorem |Chevalley l"955| vol. HI, Prop. 1, p. 163] without 
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mentioning the work of Birkhoff and Witt. Anyhow, many authors seem not to ac- 
knowledge Poincare's discovery of the fundamental notion of universal enveloping 
algebra; for example, in the encyclopaedic work [Dixmier 1974 1 Poincare's work is 
not even referred to with regard to this algebra. 

Finally we are coming to the main part of this article, namely, Poincare's proof 
of the Birkhoff- Witt theorem. But before going into detail about the proof, we 
shall make some historical remarks. As discussed earlier, none of the leading ex- 
perts in Lie theory seemed to be aware of the existence of Poincare's work on the 
universal enveloping algebra and his proof of the so-called Birkhoff- Witt theorem 
prior to about 1956. Garrett Birkhoff and Ernst Witt certainly didn't mention 
Poincare's work in | Birkhoff 1937 1 and |Witt 1937 , respectively, which both ap- 
peared in 1937. M. Lazard generalized in p^azard 1952 and jLazard 1954| this 
theorem, which he called the Witt theorem, but did mention [Birkhoff 1937 1 and 



the work of Kourotchkine [1951]. So far as we know the au thors who first noticed 
the proof of the Birkhoff- Witt theorem already appeared in [Poincare 190C|] were H. 
Cartan and S. Eilenberg, whose book [ Cartan fc Eilenbcrg 1956 1 appeared in 1956. 
Curiously, they called the theorem the Poincare- Witt theorem and did not refer 
to Birkhoff's proof; moreover, they attributed the complete proof of the theorem 
to Witt. It appears that Bourbaki was th e first to call this theorem the Poincare- 
Birkhoff-Witt theorem in | Bourbaki 196C ] , a recognition acknowledged in arguably 



the most influential book on Lie algebras in the English language [Jacobson 1962 



From then on this is the most prevalent name used for this theorem; however, many 
authors of serious books on Lie theory such as [ Kuros 1963| |, |Cohn 1981 1 and more 
recently |Knapp 1986 1, etc., continue to call it the Birkhoff- Witt theorem. 

Now let us carefully examine | Poincare 190C|, especially the portion relevant to 
our investigation, pp. 224-232. 

The section heading is "Calcul des polynomes symboliques" . Let AT, Y, Z,T,U, . . . . 
be n elementary operators (i.e., a basis for a Lie algebra over a commutative 
field K of characteristic zero). Consider the algebra of symbolic {or formal) non- 
commutative polynomials in these operators with coefficients in K. Then as previ- 
ously mentioned, we may identify this algebra with the tensor algebra T. 

Definition 3.1. Two monomials are said to be equipollent if they differ only by 
the order of their factors. This definition extends obviously to two polynomials 
that are sums of pairwise equipollent monomials. Ex. XY^, YXY, and Y^X are 
equipollent monomials, and 3XY^ + 3YZ^ + 3ZX^ and XY^ + YXY + Y^X + 
YZ'^ + ZYZ + Z'^Y + + XZX + X^Z are equipollent polynomials. 

Definition 3.2. A polynomial is said to be regular (or normal) if it can be ex- 
pressed as a linear combination of powers of the form 

(3.2) {aX + pY + -fZ + ...)P, pe N, a, /3, 7 G K. 

Poincare then makes several statements without bothering to prove them. (They 
must seem to be obvious to him; note that the same statements are made in 
[Poincare 1^ 



which is an abridged version of [Poincare 1900 , where regular 



polynomials are called normal.) However, because of the importance of their im- 
plications we shall formulate these statements as a theorem and provide the reader 
with a proof which seems to be quite long, but we do not see how to shorten it 
(we suspect that because of these claims some authors did not consider Poincare's 
proof rigorous). 
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Theorem 3.3. (i) A necessary and sufficient condition for a polynomial to be reg- 
ular is that if it contains among its terms a certain m,onomial then it must contain 
all monomials equipollent to that monomial and with the same coefficient. 

(ii) Among all polynomials equipollent to a given polynomial there exists one and 
only one regular polynomial. 

Some preparatory work is needed for the proof of this theorem. 

Let P = P{xi,... ,Xn) denote the commutative algebra of polynomials in n 
indeterminates xi,. . . ,Xn, with coefficients in the field K. For an integer m > 
let 7-"" = V"^{xi,... ,Xn) denote the subspace of all homogeneous polynomials 
of degree m. If (a) = (ai, ... ,q;„) is a multi-index of non- negative integers set 
j,{a) _ ^ai . . .^a„ | ^q,-) | _ J2i=i "i- Then it is clear that the set {a;^"^} where 
(a) ranges over all multi-indices such that \ {cy.)\ = m forms a basis for V"^. Let A = 
A{Xi^ . . . ,Xn) denote the algebra of non-commutative polynomials in Xi, . . . 
then A is obviously graded. Let A^ = A'^{Xi, . . . , X„) denote the subspace of all 
homogeneous elements of A of degree to. Define the symmetrization map (Poincare 
does not formulate this map explicitly, but it is obvious from the context that he 
must have it in mind) : P"* A™ as follows: 

For 1 < J < ai let = Xi, for ai + 1 < j < qi + 0:3 set Xj = X2, for 
ai+ a2 + l < j < al^- a2 + as, set = X3, . . . , for ai + • • • + q;„_i + 1 < j < to 
set X'j = Xm with the convention that whenever = then the term Xi does not 
appear. Let 

(3.3) <l>„^(x(")) = Sym(X(")) = ^ ^ ^-(D ' ' ' ^-(-) 

and extend by linearity on afi elements of "P™ (note that = X^^ ■ ■ ■ X«"). For 
example, with (a) = (1, 2, 0, . . . , 0) and to = 3, then a;(") = xix^, = 
Xi = Xu X'2 = X2, X!, = X2, and 

(3.4) Sym(X(«)) = ^{X[X!,X;, + X[X^X^ + X'^X'^X'^ 

+ X-i^'i^'z + X2X^X[ + X^X[X2) 
= —{X1X2 + X1X2 + X2X1 + X2X1X2 + X2X1 + X2X-1X2) 
= ^{X^Xi + X2X1X2 + XlXi). 

Now the dimension of 7-"" is precisely the number of ways a n-tuple of integers 
(«!,... ,a„) can be chosen so that |(q:)| = to. If dm denotes this number then 
a combinatorial formula gives dm = ("^m^)- Order this set of multi-indices 
following the reverse lexicographic ordering as follows: 

(3.5) "(a) ^ (/3) if for some k,l <k <n,ak < (3^, 
and ttfe+i = /3fc+i, ... , a„ = /3„, 

and (a) ^ {(3) if either (a) -< (/3) or (a) = (/?)". 

Then < is obviously a total ordering; for example, (to,, 0, . . . , 0) ^ {m—\, 1, 0, . . . , 0) ^ 
(to— 3, 3, 0, . . . , 0), and (to, 0, . . . , 0) is the first (the least) element and (0, 0, . . . , 0, to) 
is the last (the largest) element under this ordering. If p is an element of V™" of the 
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form 

p = {ciXi + • • • + c„a;„)"\ where Ci G K, 1 < z < n, 

then clearly 



(Q)eA„ 



where the multinomial coefRcient is equal to — — r, and c^"-' — c?^ • • -c"". 

If we denote by X^"' the image of by (i.e., = Sym(X°)), then by 

linearity 



(a)eA„ 



On the other hand an easy computation shows that 

(3.6) i] = sum of aU distinct elements of A"" equipollent to X^"). 



By expanding {ciXi + • • • + c„X„)'" and taking into account the non-commutativity 
of the products of the Xi we see that 

(3.7) (ciXi + --- + c„X„)™ 

= c^"-*(sum of all distinct elements oi A"^ equipollent to X*-"-*) 

(a)eA„ 

= E (:)^^"^^^"^- 

(a)eA„ ^ ^ 

Thus ^^[[ciXi + ■■■+ CnXnT) = (ci^i + • • • + C„X„)™. 

It follows by linearity that the image of a regular polynomial in 7"" is the regular of 
AJ^ ^ obtained by substituting the variable Xi by the variable X;; moreover, regular 
elements of A™ are already symmetrized. Let ^ : V ^ A denote the linear map 
obtained by setting 



(3.8) <^ip) = J2 



m>0 



where p is decomposed into homogeneous elements as p = X]m>oP"i- 

If X](a)gA^ ^(a)^'""^ is an arbitrary element of V"^{X(a) G ]K,V(q;) G A^), then 



from Eqs. (|3.3D and (3.6) it follows that 



\(a)eA„ / (a)eA,„ 

= ^ ^(a)7T^ (sum of all distinct elements of yl'" equipollent to X*-"^). 
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Since the non-ordered monomials of degree m (i.e., the set of aU distinct elements 
of A"' equipollent to for aU (a) e A„) fo rm a basis for A"\ it follows that if 

V(a)eA„ / 

then A(„) = for all (a) e A,„. Thus (f>,„ is one-to-one, and it follows that <i> 
is a monomorphism of vector spaces (but not an algebra homomorphism) . Let 
TZ = TZ{Xi,... ,Xn) denote the subs pac e of all regular polynomials of A; then 
among the consequences of Theorem ^.3| one can infer that ^ is a vector space 
isomorphism of the vector space V onto the vector space TZ, and moreover, regular 
elements in A are already symmetrized. 

Lemma 3.4. For any positive integer r there exists an n-tuple (c) = (ci, . . . ,c„) 
of positive integers such that 

(c)('3) > rc("' whenever (a) -< (/3) V(a), (/3) S A^. 

Proof. Given r G N* choose ci = 1 and define Ck inductively by Ck — r(ck-i)"^ 
for 2 < k < n. Then clearly ci < C2 < • • • < c„ and Ck — r(cfc-i)™ > 
rc""^C2^ '''Cfe^i^ since ai + ■■■ + an = m. Hence if (a) ~< {13), i.e., au < (i^ 
and ak+i = Pk+i^ ■■■ i = /3„ then 



— H ^k^-k ^k+1 Si 



> (rcr • • • C-) (ccr • • ■ CO = 



Lemma 3.5. The polynomials X*-"-* = $m(a;^"') = Sym(A"") are regular for all 
(a) e A^. 

Proof. We prove by induction that for each (/3) e A,„ the following statement holds: 
(3.9) "For every (a) ^ (/3), X*-"-* caK be expressed as 

•'{a) (a)(7) ' 

(7)^(/3) 

where /(fj** i regular element of A"^ 

and the constants X^^l, ^are rational numbers". 

(a)(7) 

First observe that if 

(/3) = (0,... ,0, m, 0,... ,0), 

T 

ith slot 

then x^^) = and <I>m(a;(^') = X^'^'^ = AT™, which is by definition regular. Thus 
the first element in this reverse lexicographical ordering is (/3) = (m, 0, . . . ,0) and 



the statement (|j|) holds trivially with = X^", where 

r{m,0.... ,0) _ ym 
J{m,0,... ,0) ^ "^1 



and 



A 



S::o::::;o),(7) = o^(^)^(-'0'---'0)- 
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Now assume the statement holds for Let (/?') denote the immediate successor 
to and consider the element e A"^ of the form 



= {ciXi + ■■■ + c„X„)™ = 



X ^ / m 



^ V(a) 



where the n-tuple (c) = (ci, . . . , c„) is yet to be determined. This implies that 



TO 



(")r<(/3) ^ ^ ^ (7)^(/3') ^ 



/,From (3.9) it follows that 



(3.10) 



(/?') 



E 

(7)^(/3') 



(")d(/3) 



(")r<(/3) (7))-(/3) 



(a) / 



E 

(7)^(/3') 



Since (/3') is right after (/?), the multi-indices (7) ;^ (/3) consist of (7) = (/?') and 
(7) iP')- Hence Eq. (3.10) can be written as 



= 5(/3') 



(")d(/3) 



E 

,(")r<(/3) 



(a) 



h>m I \("):<(/5) 



(a) 



(a)(7) 



(7) 
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This implies that 



(3.11) 



E 

("):<(/3) 



(a) J"^ ^i^W) 



E 

(a):<(/3) 



(a) r -^M 



(a) 



(a)(7) 



(7)^(/3') L \(o^)<{P) 

In Eq. (3.11) we can solve for X'^f^ ^ provided that 



(7) 



r;(7) l X'^-*. 



E 

(")r<(/3) 



(a) j"^ ^(")(^') 



is not zero. To insure this we now determine (c) = (ci, . . . , c„) as in Lemma 3.4 by 
choosing the integer r such that 



(3.12) 



r > dm max 

(a):<(/3) 



TO 

(a) 



(P) 



where dm = ^) the cardinality of A^. Then 



m 

(/3' 



JP') 



> 



(")d(/3) 



E 

(a)^(/3) 



(a) 



> 



E 

(«)^(/3) 



TO 

(a) 



(/3) 



It follows that 



TO 

iP') 



JP') 



E 

Q^)r<(/3) 



TO 

(a) 



J"),(/3) 

C ^(a)(/3') 



> 



TO 
(/?') 



E 

(a)r<(/3) 



m 



(/3) 



> 0. 



Thus we have shown that the n-tuple (c) — (ci, . . . , c„) of positive integers can be 
chosen so that the coefficient of X^^^ in Eq. ( |3.11 ) is a positive rational number, 
and the coefficients of X*-'''-' in the sum X](7);-(/3') rational numbers. Obviously, 



9m 



(")d(/3) 



is a regular polynomial, so by dividing both sides of Eq. (3.11) by the coefficient of 
X(f^ ) we can write 



(3.13) 



hXP') 
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where / 



I ^p,^ is regular and the constants ''^(^')(^) 
a di Eq. (3.£) can be written as 



(3.14) = /(f,' + 



(a) 
(/3) 



(")(/3 



A 



, 5,(/3) .(/3') 
(a) + •^(a)(/3') V) 



H •'\a)(7) 
(7)^(/3') 



are rational. For (a) ^ (/3'), i.e., 

(/3) ^(7) 



Set 



/•(/3),x(/3) f('3') „„H A^'^') 



from Eqs. (3.13) and (3.14) that for all (a) ^ {f3') 



>(/9') x(/9') 
\a)(/3')^(/3')(7) 



(/3') 
(/3')(7) 

(/3) 



+ A 



(/3) 

(a)(7) 



\a)(7) 



where Z^'^)'' is obviously regular and the coefhcients A 



(/3') 
(a)(7) 



are obviously rational. 



Hence we have completed the induction. Now for the proof of the lemma in the 
state ment ( ^.9[ ), choose (/3) = (/3)max = (0, . . . , 0, m) to be the last element of A™. 
Then (jsj) reads: 



"For every (a) ^ (/3)r 



(/3)„ 
(a) 



where /, 



. This is exactly what the lemma affirms. 



(/3)„ 
(a) 



is a regular element of 



^From the fact that $,„ : V"'' A"^ is a monomorphism it follows that the 
system {X'^"-\{a) e A,„} is linearly independent. Therefore if TZ™ denotes the 
subspace of TZ of all homogeneous non-commutative reg ular polyn omi als of degree 
TO in the indeterminatcs Xi, . . . ,Xn, then Lemma p.5| and Eq. (3/7) imply that 
the system {X", (a) G A„j} forms a basis for TZ"^ . It follows immediately from the 
discussion preceding Lemma 3.4 that $,„ : 7"" TZ™ is an isomorphism and hence, 
$ is an isomorphism of V onto TZ (clearly from Eq. (3.7) TZ™ C $m(7'™), Lemma 
PI shows that TZ™ = <^m{V™)). Now each X'-°'\ being a regular homogeneous 
polynomial of degree m, is therefore a linear combination of polynomials of the 
form {ciXi -I- • • • -|- CnXji)™- Let S be the set of such polynomials, then 5* is a finite 
set of vectors spanning the vector space TZ™. From a general fact in linear algebra 
(see, for example, [Hoffman & Kunze 1971, Corollary 2, p. 44]) we can deduce the 
following. 

Corollary 3.6. (i) The vector space TZ™ admits a basis consisting of vectors of the 
form 



(3.15) 



4^n) 



1 < i < dm, c* e 



(ii) The same conclusion holds with T'™ replacing TZ™ and Xj,l < j < n replacing 
X,_ 



Proof of Theorem 3.2. Clearly since is a graded algebra, it suffices to prove 
the theorem for A™,m > 0. 

(i) If / is a regular element of A™ then since {X'-"\ (a) e A„i} forms a basis for 
TZ™, 



/= ^(") 

(a)eA,„ 



X 



(a) 



A(a) e 
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Thus if A(q) ^ then since X'"^ contains all distinct monomials equipollent to X'") , 
therefore / contains all monomials equipollent to X*^") with the same coefficient 

Conversely, if / is a polynomial in A"^ which contains all monomials equipollent to 
a fixed monomial, which we may assume without loss of generality to be X^"^ with 
the same coefficient, then / must contain with a non-zero constant. 

Hence / is of the form 

(a)eA™ 

and therefore is a regular polynomial. 

(ii) First observe that if Xi-^ ■ ■ ■ Xi^ ,^ < ij < n,! < j < m, is a monomial in yl™, 
then it is equipollent to a unique monomial X^"^ = • • • X"" for some (a) € Am. 
Then from the definition of = Sym(X") and part (i) of this theorem, 
is the unique regular polynomial of A"^ that is equipollent to X^°'\ and hence 
to Xij...,;,^ (for example, X2X1X2 is equipollent to x^^'^'^'- - — X1X2, which 



from Eq. (3.4) is equipollent to the regular polynomial X^-^'^'^'--- '^"^ — i(XiX| 
X2X1X2 + XlXi) = i[(Xi + X2f + (Xi - ^2)3 - 2Xf]). Let 

U : ■ ■ ■ ,Un 

where the sum is over all distinct non-commutative homogeneous monomials of 
degree m and the coefRcients A^^...^^ are uniquely determined. Then since each 
Xi-^ ■ ■ ■ Xi^_^ is equipollent to a unique regular polynomial X^"^ for some (a) G A,„, 
p is equipollent to the unique regular polynomial 

(a)eA„ 

where fi/^\ is the sum of all for which A"ij...,;^ is equipollent to X^^y | 



Remark 3.7. It follows from Corollary 3.6(i) and Eq. (3.7) that a polynomial p of 



A is regular if and only if Sym(p) = p. Now define (as in [Godement 1982, 5.6.1]) 
a polynomial 



P 



= E ■ ■ ■ ^j™; ^ < ij < n,l < j < m, 



to be symmetric if all its coefficients Aij...,;^ are symmetric, i.e., Xi 



Aii...j„, for all a G Sm- Then Theorem 3.3(i) implies that a polynomial in A is 



(i)- 



regular if and only it is symmetric, since a monomial is equipollent to Xi-^ ■ ■ ■ Xi^ , 
if and only if it is of the form • • ■ for some a G Sm- Thus in this context 

regular is synonymous with symmetric, and this is probably what [ Bourbaki 1969| 



must have had in mind when he affirmed that Poincare gave a proof of algebraic 
nature that the associative algebra generated by the X;, 1 < i < n, has as basis 
certain symmetric functions in Xi. In fact, in [ Godement 1982| , 5.6.1], for example. 



this fact is used to define the vector space isomorphism (3 : S{q) ^(fl) of the 
symmetric algebra of polynomial functions on the dual of the Lie algebra q onto 
the universal enveloping algebra g. Obviously, V is isomorphic to S{q) and as we 
shall see TZ is isomorphic to U{q)\ thus the map (3 is basically $. 
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Now let US return to [ Poincare 1900[ . Let Xi, . . . , Xr be elementary operators 



(i.e., infinitesimal transformations) which form a basis for a Lie algebra C. Define 
the Lie bracket as 

(3.16) [X,Y]^ XY -YX; X,Y e C. 

Two polynomials in A are said to be equivalent if one can be reduced to the other 
in taking into account relation ( 3.16| ). 



For example, the product P{XY-YX-[X, Y])Q as defined in Eq. (|3j) (where 
the first and the last factors P and Q are two arbitrary monomials in A) is equivalent 
to zero, and obviously so are linear combinations of products of that form (i.e., P 
and Q may be taken to be polynomials). Products of the form ( |3.l[ ) are called 
trinomial products. 

The difference of two monomials which differ only by the order of two consecutive 
factors is equivalent to a polynomial of lesser degree. Indeed, let X and Y be those 
two consecutive factors. Then our monomials are written as 

PXYQ and PYXQ, 

P and Q being two arbitrary monomials, and their difference 

P{XY -YX)Q 

is equivalent to P[X, Y]Q, which has degree one less, since [X, Y] is of first degree, 
while XY and YX are of second degree. 

Now let M and M' be two arbitrary equipollent monomials; that is, they only 
differ by the order of their factors. One can find a sequence of monomials 

M,Mi,M2,... ,Mp,M', 

in which the first and the last terms are the given monomials and any term in the 
sequence differs only from the preceding by the order of two consecutive factors. The 
difference M — M' , which is the sum of the differences M — Mi , Mi — M2 , ■ ■ ■ , Mp — 
M', is therefore again equivalent to a polynomial of lesser degree. 

More generally, the difference of two equipollent polynomials is equivalent to a 
polynomial of lesser degree. We now claim the following. 

Theorem 3.8. In the algebra A any arbitrary polynomial is equivalent to a unique 
regular polynomial. 

Proof. First let us show that this equivalence relation is additive, i.e., if p ^ p' ^ 
and q ^ q' then p + q ^ p' + q' . This is obvious since this is equivalent to p — p' ~ 
and q-q' ^0, and hence {p + q) - {p' + q') ^ {p ~ p') + {q - q') ^ + Q ^ Q. 



Now let P„ be an arbitrary polynomial of degree n; then by Theorem 3.3 (ii) 
P„ is equipollent to a unique regular polynomial of the same degree n, and by 
the remark preceding this theorem, P„ — P^ is equivalent to a polynomial P„_i of 
lesser degree (which we may assume, without loss of generality, of degree n — 1). 
Therefore, P„ ~ P^ + Pn-i, and P„_i is in turn equipollent to a regular polynomial 
Pn-i, and hence 

Pn ^ Pn+ Pn-l = P,' + Pn-l + (Pfi-1 - P'n-l) ^ P'n + P'n-1 + (-ffi-2), • ■ ■ , 



^Poincare used the symbol = to denote this equivalence relation. To avoid confusion we adopt 
the more conventional symbol ~. 



POINCARE'S PROOF OF THE SO-CALLED BIRKHOFF-WITT THEOREM 



17 



and so on; one finally arrives to a polynomial of degree zero which is obviously 
regular. Thus one can conclude that 

where the second member is a regular polynomial. We therefore have a means 
to reduce any polynomial to a regular polynomial by making use of the relations 
( 3.16| ). It remains to find out if this reduction can be done uniquely. 



Since both the equivalence relation ^ and the notion of regular polynomials are 
additive, this problem is equivalent to the following: 

Can a non-identically zero regular polynomial be equivalen t to zero! Or equiva- 



lently, can we find a sum of trinomial products of the form (3^) which is a non- 
identically zero regular polynomial! All sums of such products are indeed equivalent 
to zero and vice- versa. If we define a regular sum to be a sum of trinomial products 
of the form (|3.l[ ) which is also a regular polynomial then the answer (negative) to 
this question (and hence to the question above regarding uniqueness) can be stated 
as follows: 

Lemma 3.9. Every regular sum is identically zero. 



Proof of the lemma. The degree of a trinomial product (3.1) is clearly d'^{P) + 
d°{Q) + 2. Thus we call the degree of a sum S of trinomial products the highest 
of all the degrees of the products in S even though as we shall see when S is a 
regular sum the terms of highest degree in these different products mutually cancel 
each other. 



The trinomial product (3J_) can be considered as the sum of two products, the 
binomial product 

(3.17) P{XY -YX)Q, 

where we call PXYQ the positive monomial and —PYXQ the negative monomial; 
and the product 

(3.18) -P[X,Y]Q, 

which we call the complementary product. 

Thus if S is an arbitrary sum of trinomial products of degree p and of degree 
< p then we can write 

(3.19) S^Sp-Tp + Sp-i-Tp-i + --- + Sk-Tk + --- + S2- T2, 

where Sk, 2 < k < p, is a sum of homogeneous binomial products of degree fc, 
whereas —Tk is the sum of the corresponding complementary products. First ob- 
serve that if 5 is a regular sum then every homogeneous component of S is also 
regular since regularity is graded; in particular Sp is regular. Since Sp is a sum of 
binomial products of degree p of the form PXYQ — PYXQ and since equipollence 
is an additive equivalence relation it follows immediately that Sp is equipollent to 
zero. But zero is a regular polynomial and Theorem |3.3| (ii) implies that two regular 
polynomials cannot be equipollent without being identical, and therefore Sp must 
be identically zero. 

Remark 3.10. From the discussion above it follows that the degree of a regular sum 
as we defined it is actually at least one more than the classical degree of a polynomial 
in A. 
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Thus in particular when 5 is a regular sum of degree 3 (actual degree 2) then 

(3.20) S = S3~^n + S2-T2. 

Since S3 is homogeneous of degree 3, a typical binomial product of S3 must be 
of the form 

{XY-YX)Z or Z{XY-YX). 

Since S3 is regular, hence symmetric, Theorem |3.3| (i) implies that if the binomial 
product {XY — YX)Z = XYZ — YXZ occurs in ^3, all six monomials equipollent 
to XYZ (resp. —YXZ) must occur in S3 with the same coefficient. 

Thus S3 must be a sum of terms of the form 

(3.21) ^{XY -YX)Z -^Z{XY - YX), 

where the sign ^ means that one must sum over the term which is explicitly 
expressed under the sign and the other two terms obtained by cyclically permuting 
the three letters X, Y , Z. Note that one can verify directly from Eq. (3.21) that S3 
is identically zero. It follows from Eq. (3.21) that the sum of the complementary 
products — T3 contains terms of the form 

(3.22) -{^[X,Y]Z-Y,Z[X^Y]) 

Since S2 — T3 is homogeneous of degree two and S is regular, it follows that S2 — T3 is 
also regular, and hence symmetric. Since it contains — [X, Y]Z + Z[X, Y], Theorem 
3.3 (i) again implies that it must contain permutations of these terms with the same 
coefficients, i.e., 

- [X, Y]Z - [Y, X]Z - [Z, Y]X - [X, Z]Y - [Z, X]Y - [Y, Z]X 

+ Z[X, Y] + X[Z, Y] + Y[Z, X] + Z[Y, X] + Y[X, Z] + X[Y, Z], 
which can be regrouped in the following form: 

(3.23) - (J2[X, Y]Z ^ Z[X, Y]) + (^^[X, Y]Z ^[^' ^1; 



using the fact that the bracket product [ , ] is anti-symmetric. From Eqs. ( 3.22 ) 
and ( 3.23 ) it follows that 

(3.24) S2^Y.^X,Y]Z-Y,Z[X,Y] 

and 52 -T3 = 0. Since S2 is a sum of terms of the form WZ-ZW with W = [X, Y] 
it follows that the complementary polynomial is of degree one and is a sum of terms 
of the form [W, Z] . Thus we have 

T2 = Y.[[X,Y],Z], 

where ^ has the same meaning as above. Thus T2 is a sum of terms of the form 

(3.25) [[X, Y],Z] + [[y, Z],X] + [[Z, X],Y]. 



It follows from Eq. ( |2.lD that T2 is a polynomial of first degree which is obviously 
symmetric, and hence regular. Therefore, if T2 is not identically zero, the sum S 
would be a regular polynomial which is not identically zero. 

Therefore, in order that a polynomial can be reduced in a unique fashion to a 
regular polynomial, it is necessary that the expression ( 3.25| ) is identically zero. 
But one recognizes there the Jacobi identities which play such an important role in 
Lie theory. It remains to show that this condition is sufficient. 



POINCARE'S PROOF OF THE SO-CALLED BIRKHOFF-WITT THEOREM 



19 



At this juncture, it is important to make the foUowing remark: 



It follows from Remark 3.10 that we have actually proved that every polynomial 
of degree 0, 1, or 2 is equivalent to a unique regular polynomial of degree 0, 1, or 
2, respectively. 

Now by induction suppose that the lemma has been proven for regular sums of 
degree 1,2,... ,p — 1 and propose to extend it to regular sums of degree p. 

Thus, let S — Sp — Tp + Sp-i — Tp_i + • • • be a sum of trinomial products. Let 
us call Sp — Tp the head (or leading terms) of the sum S. We say that a sum of 
trinomial products form a chain if the negative monomial of each product is equal 
and of opposite sign of the positive monomial of the product that follows. The 
positive monomial of the first product and the negative monomial of the last one 
are called extreme monomials of the chain. Examples of chains: 

Ci : XZ{XY)W - XZ{YX)W - XZ[X, Y]W + X{ZY)XW - X{YZ)XW 
- X[Z, Y]XW + XY{ZX)W - XY{XZ)W - XY[Z, X]W. 

C2 ■■ XZ{XY)W - XZ{YX)W - XZ[X, Y]W + XZY{XW) - XZY{WX) 
- XZY[X, W] + X{ZY)WX - X{YZ)WX - X[Z, Y]WX + XYZ{WX) 
- XYZ{XW) - XYZ[W, X] + XY{ZX)W - XY{XZ)W - XY[Z, X]W. 

Remark 3.11. /< results from the definition that all positive monomials [and hence, 
all negative monomials) of the same chain can only differ by the order of their 
factors. 

A chain is said to be closed if its extreme monomials are equal and of opposite 
sign. If Sp~Tp is a closed chain of trinomial products it is clear that Sp is identically 
zero since the positive and negative monomials cancel each other two by two. 

We have seen that if 5 is a regular sum, Sp is identically zero. It follows therefore 
that the head of a regular sum must always consist of one or more closed chains. 

If two chains have the same extreme monomials, then their difference is a closed 
chain. For example, 

C1-C2: XYZ{XW) - XYZ{WX) - XYZ[X, W] 

+ X{YZ)WX - X{ZY)WX - X[Y, Z]WX + XZYWX 

- XYZXW + XZY[X, W] - X[Z, Y]XW. 

We shall use this remark to show that a closed chain can always be decomposed 
in many ways into two or more closed chains. An arbitrary closed chain can be 
in many ways regarded as the difference of two chains C and C" having the same 
extreme monomials. Let C" be a third chain having the same extreme monomials, 
then the chain C — C" is then decomposed into two other closed chains C — C" and 
C" - C". 

Now remark first that if a regular sum of degree p is identically zero, it must be 
the same for all regular sums of degree p which have the same head. The difference of 
these two sums will be indeed a regular sum of degree p— 1 which will be identically 
zero according to our inductive hypothesis. Therefore it suffices for us to form all 
closed chains of degree p and prove that each one of them can be considered as the 
head of an identically zero regular sum. Indeed, each regular sum S of order p has 
as head one or more of those closed chains. Let S' be one of those closed chains, 
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then if we show that there exists an identically zero regular sum having S' as head, 

it follows immediately from the remark above that S must be identically zero. Thus 
by induction we suppose this statement holds for all closed chains of degree < p — 1 
and we will show that it is true for all closed chains of degree p. 

To establish this assertion, we are going to decompose the closed chain in question 
into several closed chains. It is clear that it suffices to prove the proposition for 
each component. 

A chain is called simple of the first kind if the first factor of all of its monomials 
either positive or negative is everywhere the same. A chain is called simple of the 
second kind if the last factor of all of its monomials either positive or negative is 
everywhere the same. Moreover, a simple chain can be either closed or open (not 
closed). 

Since p is larger than three, it is clear that every closed chain can be regarded as 
the sum of a certain number of simple chains, alternatively of the first and second 
kinds or vice- versa. 

Thus let 5 be a closed chain, Ci, C2, . . . , C„ be simple chains of the first kind, 
, Cj, . . . , be simple chains of the second kind, such that 

S = Ci + C[ + C2 + C!, + --- + Cn + C'^, 

the extreme negative monomial of each chain being, of course, equal and of opposite 
sign to the extreme positive monomial of the next chain, and the extreme negative 
monomial of being equal and of opposite sign to the extreme positive monomial 
of Ci. Note that, a priori, Ci or C^, can be the zero chain, for example, if S starts 

with XYQ — YXQ , where d^{Q) > 1; then Ci — 0, but then we can consider 

Ci as the zero simple closed chain of the form XYQ — ■ ■ ■ — XYQ, and similarly 

for c;. 

Let X be the first factor of all the monomials of Ci , Z the last factor of all the 
monomials of C( , Y the first factor of all the monomials of C2 , and T the last factor 
of all the monomials of (we do not exclude the case where two of the operators 
X, y, T are identical). 

Let C" be a simple chain of the second kind having its extreme positive monomial 
equal and of opposite sign to the extreme negative monomial of C2, and in which 
all monomials have the last factor equal to T, and moreover, the extreme negative 
monomial has X as its first factor. 

Let C" be a simple chain of the first kind such that all monomials in it have 
X as the first factor, and moreover, the extreme monomials are respectively equal 
and of opposite signs to the extreme negative monomial of C" and to the extreme 
positive monomial of Ci . 

Schematically we have the following diagram: 

(3.26) XU XUZ^-XUZ YUZ + YUZ YUT 

'• V ' ^ V ' ^ V ' 

Ci C[ C2 

+ YUT UT + UT XUT + XUT XU, 

^ V ' ^ V ' ^ ~, ' 

C'^ C" C" 

where each box □ represents certain unspecified monomial which does not have any 
effect on our discussion. 
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Thus the closed chain S is decomposed into a sum of two closed chains as S' + S", 
where 

S' = {C" + Ci) + C[+C2 + iC!,+C"), 
S" ^ -C" + C3 + • • • + C„ - C". 

The closed chain 5" contains only four simple chains, since (C"" + Ci) and (C^ + C ) 
are simple chains; S" contains two simple chains less than S. Continuing this 
scheme we end up decomposing S into closed components which consist of only 
four simple chains. Thus it suffices to consider the case of closed chains S formed 
by four simple chains as, for example, the form S' . 

Therefore, it follows from ( ^.26 ) that the extreme positive monomials of the four 



chains that form S' have respectively for first and last factors: 

for C" + Ci X and T, 

for C[ X and Z, 

for C2 Y and Z, 

for C'2 + C" Y and T. 

Let MiM'-y, M2, M'2 denote these four monomials. 

^From Remark 3.11 it follows that all these monomials are equipollent to each 



other and are equipollent to a certain monomial which we will call XYPZT. Set 

Qi = XYPZT, Q[ = XYPTZ, 
Q2 = YXPTZ, Q'2 = YXPZT. 

We are going to construct a series of simple chains which will constitute a de- 
composition of S' as follows: 



Name of the chain Extreme positive monomial Extreme negative monomial 



C" + Ci 


Ml = XUT 


-M{ = -xnz 


C[ 


M[ 


-M2 = -YUZ 


C2 


M2 


-M^ = -YDT 


C'2 + C" 




-Ml 


Di 


Ml 


~Qi 


D'r 


M[ 


-Q'l 


D2 


M2 


-Q2 


D'2 




-Q2 


El 


Qi 


-Q'l 


E[ 


Q'l 


-Q2 


E2 


O2 


-Q'2 


E'2 


O2 


-Qi 



We can suppose that every monomial of the chain Di has as first factor X and last 
factor T; thus Di is both a simple chain of the first and second kind, and similarly 
for other D and D' chains. Furthermore, we can suppose that the E and E' chains 
are reduced to a single trinomial product, for example. 



El = XYP{ZT -TZ - [Z, T]). 
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The closed chain S' = {C" + Ci) + C{ + C2 + {C2 + C") can be decomposed 
into five closed chains as follows: 





= (Mi 


M[) + M[--- 


-Q'l 


+ XYP{TZ 


-ZT-[T,Z]) + Qi 

— V ' ' 


• • • - Ml, 

V ' 




C"'+Ci D[ 






-El 


-Di 




= Mi- 


■■-M2 + M2 




- (YX - XY 


- [Y,X])PTZ+Q[- 


■■-M[, 






Ci D2 






-V ■' ^ 

-E'l 


-V ■' 

-D'l 


U2 


= M2- 


• • - + M2 


Q'2^ 


- YXP{ZT - 


TZ-[Z,T]) + Q2-- 

V ' ^ 


■-M2, 






-V ' ^ V 

C2 D'^ 








-D2 




= M'2- 


Ml + Ml 

-V ' " «— 


0i4 


- {XY - YX 


-[X,Y])PZT + Q'^- 
-V ' ^ 


• • - M^, 



C2+C" Di -E'^ -D'2 



V = XYP{ZT -TZ- [Z, T]) + {XY -YX - [X, Y])PTZ 

^ V ' ^ V ' 

El E[ 

+ {YXP{TZ -ZT- [T, Z]) + {YX - XY - [Y, X])PZT. 

^ V ' ^ V ' 

E2 E'^ 

Clearly, Ui + U[ + U2 + U!2 + V = {€"' + Ci) + C{ + C2 + (C^ + C") = S. We must 
show that each one of the five closed chains above is the head of an identically zero 
regular sum. The first four chains are of the form 

U,=XH,, U[=H[Z, U2 = YH2, U!2 = H'2T, 

where each chain Hi, H[, H2, i?2 a closed chain of degree p — 1; therefore by 
induction, each is the head of an identically zero regular sum. It follows that Ui, 
U{, U2, and U2 are identically zero, and therefore each of them can be considered 
as the head of an identically zero regular sum of degree p. 
Finally for V, it is the head of the sums 

XYP{ZT ^TZ- \Z, T]) + {XY -YX - [X, Y])PTZ 

- YXP{-TZ + ZT- [Z, T]) - {-YX + XY - [X, Y])PZT 

- [X, Y]P{ZT -TZ- [Z, T]) - {XY -YX- [X, Y\)P[T, Z], 

which can be expanded and rearranged as 

XYPZT - XYPTZ + XYPTZ - YXPTZ + YXPTZ 

- YXPZT + YXPZT - XYPZT - XYP[Z, T] - [X, Y]PTZ 
+ YXP[Z, T] + [X, Y]PZT - [X, Y]PZT + [X, Y]PTZ 

- XYP[T, Z] + YXP[T, Z] + [X, Y]P[Z, T] + [X, Y]P[T, Z], 

which is identically zero. Since is a regular sum, it follows that V is the head of 
an identically zero regular sum of degree p. 

Note that our analysis remains unchanged when two or more of the operators 
X, Y, Z, T are identical. For example, when X = Y, then E[ = E'2 = 0, and 
we set Qi = Q'2 = X{XP)ZT = XP'ZT, Q2 = Q[ = X{XP)TZ = XP'TZ. 
The definition of the various chains remains the same, and we can immediately 
verify that V is identically zero. Finally, in order that this proof is valid, p must 
be greater than three since the chain V must have at least four factors. But this 
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was the assumption in our inductive hypothesis. Thus the proof of Lemma 3.9, and 



hence of Theorem 3.8, is achieved 



Corollary 3.12. (The so-called Birkhoff-Witt Theorem.) LetU{C) denote the uni- 
versal enveloping algebra of a Lie algebra over a (commutative) field of character- 
istic zero. If {Xi, . . . , Xn} is a basis of C and if (a) — (ai,... denotes 
an n-tuple of integers > 0, set = X"' ■■■X^",X^°'^ = Sym(X"), |(a)| = 

ai + • • • + a„. Then the set {X^"^}^^,), for all (a) such that \{a)\ > 0, forms a 
vector space basis for U{C). Moreover, any set of elements of IA{C) of the form 
{Xi-^ ■ ■ -Xi^, 1 < ij < n,l < j < m,m > 0}, where each Xi-^ ■ ■ ■ Xi^ is a repre- 
sentative of an equipollence class X^"^ for all distinct (a) such that \{a)\ > 0, is a 
basis ofU{C); in particular, the set of ordered monomials |(q;)| > 0} forms 

a basis for IA{C). 

Proof. ^From our discussion pertaining to Poincare's discovery of the universal 
enveloping algebra of a Lie algebra, it follows that the quotient algebra of the 
polynomial algebra A modulo the equivalence relation ~ can be regarded as the 
universal enveloping algebra of the Lie algebra C generated by Xi, . . . , Xn- 

Define a map from A to TZ, the vector space of all regular polynomials in A, by 
assigning to each polynomial A in A the unique regular polynomial A equivalent 



to A as defined by Theorem |3.8| . From the proof of Theorem |3.8| it follows that 
this map is linear, and that it is surjective since the unique regular polynomial 
equivalent to a given regular polynomial is itself. The kernel of this homomorphism 
is, by the definition of the equivalence relation the vector space spanned by 
all trinomials of the form P{XY — YX — [X,Y])Q for arbitrary P and Q in A. 
Let I denote this kernel, then obviously I is a two-sided ideal of A. It follows 
from the first isomorphism theorem that A/T is isomorphic to TZ as vector spaces. 
^From the remark following Lemma |3.5^ it follows that the set |(a)| > 0, 

forms a basis for TZ, and hence a basis for U{C) ^ A/X via the isomorphism above. 
Note that we have shown following Lemma that T' is isomorphic to TZ via 
the isomorphism <I>, therefore T^ is isomorphic to U{C). For the second part of 
the theorem, we remark that it follows from Theorem that each X-i^ ■ ■ ■ Xi^^ 
is equipollent to a unique regular polynomial X*^") for some [a) e Am. Thus it 
suffices to consider the set {X'^°'\ |(q!)| > 0}. We also remark that it suffices to 
show that the set {X'^°'\ (a) € Km} is linearly independent in U{£,) for all m > 0, 
since U (£) is a filtered algebra. From the proof of Theorem |3.8| it follows that 
each X''"',(a) G A„j, is equivalent to a unique regular polynomial of the form 
+P{a), where is a regular polynomial of degree < to. Thus if for some 
scalars A(q) € K such that X](q)£A„ '^(q)-'^^"^ is zero 'm.U{L) (i.e., equivalent to 0), 
then since the equivalence relation ^ is linear it follows that the regular polynomial 
S(a)eA ^a{X'^"'^ + Pa) is equivalent to 0. It follows from Theorem 3.8 (or more 



precisely Lemma 3.9) that 



E 



Q.)-^ (a) 



(q)GAt^ (a)eATi 
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must be identically zero. Since d"{X^°'^) = m and dP{P(^a)) < ™ for all (a) G A^, 
it follows that 

(a)eA™ 

and hence by the first part of the proof of the theorem, it follows that Xf^) — for 
all (a) e Ajn- This completes the proof of the theorem. | 

Remark 3.13. We note that throughout this section the basis for the Lie algebra 
C can be the infinite set {Xi, . . . , X„, . . . }. The tensor algebra T remains isomor- 
phic to the non-commutati ve algebra A of polynom ials in infinitely many variables 



Xi,... ,X„,... (see, e.g., fSchwartz 1998 (1975)i Prop. (2.4), p. 40]), and every 



argument remains the same. As a special case, let £„ (resp. An) denote the Lie 
algebra (resp. the non-commutative polynomial algebra) generated by Xi, . . . , Xn- 
Let C (resp. ,4) denote the inductive limit of (resp. An)', then all theorems in 
this section can be easily generalized. For example, let 

d 

Xij = Xi- — , I <t,j < n; then 

[Xij,Xki] ~ SjkXii — SliXkj, 

and {Xij ,l<i,j<n} generates the Lie algebra gin and the associative polynomial 
algebra An, respectively. By letting n ^ oo we get the Lie algebra qIoo and Aoo, 
respectively. Another example is the Heisenberg Lie algebra 7i„ spanned by the 
vector fields 

V2 / d \ ^ V2 f d 



and R — il, where / is the identity operator. Then we have the commutation 
relations 

[Pj,Pk] = [Qj,Qk] = 0, [Pj,R] = [Q„ i?] = 0, [Pj,Qk] = -SjkR, l<j,k<n. 

Let An denote the algebra of non-commutative polynomials in the vector fields 
Pi, . . . , Pn, Qi, ■ ■ ■ , Qn and R. Then when n ^ oo we obviously have the general- 
ization of the theorems in this section to the Heisenberg Lie algebra Tioc , and hence 

to v4oo ■ 

4. Conclusion 

A. Einstein said "A good idea is very rare" . We reckon that there are, at the 



very least, three "good" ideas in [Poincare 1900 1, namely, the universal enveloping 



algebra of a Lie algebra, the symmetrization map, and the proof of the so-called 
Birkhoff-Witt theorem. And in our opinion, none of these were properly appreci- 
ated and recognized. We have gone to great length and sometimes with repetitive 
arguments to try to convince the mathematics community of what a great feat 
Poincare has achieved in [ Poincare 1900[ . But even if we fail, we would be much 



wiser by our reading a masterpiece by a great master. 

We leave the reader with the following thought of Paul Painleve, another great 
master, in the obituary written for the newspaper Le Temps (and reprinted in 
[Painleve 1921 ), on July 18, 1912, the day after Poincare died: 
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"Henri Poincare n'a pas ete seulement un grand createur dans les sciences posi- 
tives. II a ete un grand philosophe et un grand ecrivain. Certains de ses aphorismes 
font songer a Pascal: 'La pensee n'est qu'un eclair entre deux longues nuits, mais 
c'est cet eclair qui est tout'. Son style traduit la demarche meme de sa pensee: des 
formules breves et saisissantes, parodoxales parfois quand on les isole, reunies par 
des explications hatives, qui rejettent des details faciles pour ne dire que I'essentiel. 
C'est pourquoi des critiques superficiels lui out reproche d'etre 'decousu': la verite, 
c'est que, sans education scientifique prealable, une telle demarche logique est dif- 
ficile a egaler: le lion ne fait pas des enjambees de souris"^. 

Acknowledgement. The authors wish to thank the editors of the Revue d'Histoire 
des Mathematiques whose comments helped to improve the overall quality of this 
article. They also wish to thank the technical staff of the Department of Mathe- 
matics of the University of Iowa, Mr. Brian Treadway and Ms. Cymie Wehr, for 
their impeccable job and for their patience in preparing this manuscript. 



This can be roughly translated as follows: "Henri Poincare was not only a great creator in 
the positive sciences. He was a great philosopher and a great writer. Some of his aphorisms 
make us think of Pascal: 'Thought is just a flash of lightning in the middle of two long nights, 
but it is this lightning that is everything'. His style reflects the very development of his thought: 
brief and startling formulae, sometimes paradoxical when one isolates them, joined together by 
hasty explanations, which reject easy details in order just to express the essential. That is why 
superficial critiques reproach him as being 'incoherent': the truth is that, without prerequisite 
education, such logical development is difficult to match: the lion does not take a mouse's paces." 
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